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ABSTRACT 

We show how observations of the evolution of the galaxy cluster number abundance can be used 
to constrain primordial non-Gaussianity in the universe. We carry out a maximum likelihood analysis 
incorporating a number of current datasets and accounting for a wide range of sources of systematic error. 
Under the assumption of Gaussianity, the current data prefer a universe with matter density O m ~ 0.3 
and are inconsistent with Cl m = 1 at the 2a level. If we assume Q m — 1, the predicted degree of cluster 
evolution is consistent with the data for non-Gaussian models where the primordial fluctuations have 
at least two times as many peaks of height 3<r or more as a Gaussian distribution does. These results 
are robust to almost all sources of systematic error considered: in particular, the O m = 1 Gaussian case 
can only be reconciled with the data if a number of systematic effects conspire to modify the analysis in 
the right direction. Given an independent measurement of $7 m , the techniques described here represent 
a powerful tool with which to constrain non-Gaussianity in the primordial universe, independent of 
specific details of the non-Gaussian physics. We discuss the prospects and strategies for improving the 
constraints with future observations. 



1. INTRODUCTION 

Most studies of structure formation in the universe 
start from the assumption that the primordial pertur- 
bations were Gaussian. Many well motivated mod- 
els, however, predict non-Gaussian fluctuations, includ- 
ing topological defect models (Kibble 1976; Vilenkin & 
Shellard 1994), and certain forms of inflation (Peebles 
1983,1997,1998a,1998b; La 1991; Amendola & Occhionero 
1993; Amendola & Borgani 1994). One area which has 
recently received a considerable degree of attention in the 
Gaussian case is the evolution of the galaxy cluster num- 
ber abundance. Several authors (Frenk et al. 1990; Oukbir 
& Blanchard 1992; Oukbir & Blanchard 1997; Fan, Bah- 
call & Cen 1997; Gross et al. 1998; Blanchard & Bartlett 
1998; Eke et al. 1998; Reichart et al. 1999; Viana & Lid- 
die 1999) have shown how cluster evolution data can be 
used to constrain the matter density of the universe under 
the assumption of Gaussianity, with most studies favoring 
a low density universe. 

Some attempts have been made to extend studies of 
cluster formation to the non-Gaussian case. Oukbir, 
Bartlett & Blanchard (1997) noted the existence of a de- 
generacy between the spectral index of the primordial fluc- 
tuations and primordial non-Gaussianity. Colafrancesco, 
Lucchin & Matarrese (1989), and later Chiu, Ostriker & 
Strauss (1997) have introduced a modified version of the 
Press-Schechter (1974) formalism to make predictions for 
the cluster number density in a non-Gaussian model, and 
Robinson, Gawiser and Silk (1998, hereafter RGS98) have 
generalized this approach. Robinson & Baker (2000, here- 



after RB00) have tested this formalism and verified that it 
is able to accurately fit the evolution of the cluster num- 
ber abundance observed in N-body simulations of struc- 
ture formation with non-Gaussian initial conditions. Var- 
ious authors (RGS98; van de Bruck 1998; Koyama, Soda 
&: Taruya 1999) have used this formalism to study the 
evolution and clustering properties of galaxy clusters and 
thus constrain non-Gaussianity in the primordial fluctu- 
ations. Willick (1999) has considered the constraints on 
non-Gaussianity which can be derived from the existence 
of cluster MS1054-03 at a redshift z = 0.83. In this paper, 
we extend previous work by carrying out a detailed study 
of cluster evolution in the non-Gaussian case, performing 
a Bayesian likelihood analysis, considering a number of 
different cluster observations, and accounting for a wide 
range of possible sources of systematic error. 

Our results demonstrate that given an independent mea- 
surement of the matter density of the universe (which we 
can realistically expect to gain in the near future from 
CMB and supernovae observations) , cluster evolution data 
can place strong constraints on non-Gaussianity. The 
greatest strength of our analysis is that it uses observations 
to place direct constraints on non-Gaussianity (in partic- 
ular, on the probability distribution function, or PDF, of 
the primordial density field) , without any reference to the 
details of the non-Gaussian physics in specific models. Be- 
cause of the infinite range of possible non-Gaussian models 
and the difficulties in making accurate predictions even in 
well specified cases, the model independent constraints on 
non-Gaussianity considered here are therefore particularly 
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powerful. In section we discuss the parameterization of 
our models and the process of predicting the cluster num- 
ber density at different redshifts. In section |3] we discuss 
the different datasets used in our analysis and our method 
for computing the likelihood of observing a dataset given 
one of our models. In section we discuss the resulting 
constraints on non-Gaussianity from existing cluster data, 
the likely sources of systematic error, and the improve- 
ment we can hope to gain from future data. In section |5j 
we draw our conclusions. 



2. MODELS 

The models we consider are specified by knowledge of 
the power spectrum of the fluctuations, the nature of the 
non-Gaussianity, the background cosmology, and the clus- 
ter mass-temperature relationship. We discuss the param- 
eterization of these properties in the following subsections. 



2.1. Power Spectrum 

We parameterize the power spectrum P(k) for each 
model using a cold dark matter (CDM) form (Bond et 
al. 1991), that is 

P(k) ex kT 2 {k) (1) 

where 



T(k) = 



ln(l + 2.34q) 
2.34 9 

[l + l.389q+{16.lq) 2 



(5.46g) 3 + (6.71q) 4 



-1/4 



and 



T/iMpc" 



(2) 



(3) 



Here T is the shape parameter (which is well fit by T = 
Vt m h exp(f^B — rie/^m) for both open and flat CDM mod- 
els, where J7 m is the contribution of CDM and baryons 
to the critical density at z = 0, FIb is the contribution 
of the baryons, and the Hubble constant is parameterized 
via Ho = 100/ikm s~ 1 Mpc~ 1 ). Although this form is de- 
rived within the context of the CDM model, it can also 
be used to fit power spectra from a range of structure for- 
mation scenarios, at least over the range of scales relevant 
to cluster formation. As we will see later, our results are 
quite insensitive to the value of T, suggesting that a more 
precise parameterization of the power spectrum (for in- 
stance, allowing for the possibility of a primordial tilt) is 
not necessary. We quantify the normalization of the power 
spectrum by specifying <t 8 , the rms density fluctuation in 
spheres of radius 8/i _1 Mpc. 

2.2. Non-Gaussianity 

Non-Gaussianity is specified in terms of the probabil- 
ity distribution function (PDF) pr(6) of the primordial 
fluctuations, where pr(S) dd is the probability of the over- 
density (linearly extrapolated to a redshift z = 0) in a 
sphere of radius R having a value between S and S + d5. 
For most non-Gaussian models, it is reasonable to assume 
that the PDF is scale invariant over the range of scales 



relevant to cluster formation (see RB00 for specific exam- 
ples). In this case, we can make use of a rescaled PDF, 
which we denote P(y), satisfying 



P(y)dy = —p R (S)dS 

O-R 



(1) 



where an is the rms fluctuation in spheres of radius R. 

The rescaled PDF P(y) has a mean of zero and an rms of 
one. For a wide range of models (see RB00 for examples) 
P(y) can be well fit by a log-normal distribution, that is 



where 



with 



Hn(v) 



x{y) 



C 



V2^42 



MB + Cy\A\/A) 
\A\ 



B = e A2 / 2 



C = \ / B i - B 2 



(5) 

(6) 

(7) 
(8) 



This distribution has one free parameter A, with the limit 
A = corresponding to Gaussianity. Following RGS98, we 
characterize each PDF in terms of a single non-Gaussianity 
parameter G, where 



G = 2tt 



I 3 °° p (y)dy 

f™e-y 2 / 2 dy' 



(9) 



G is the probability of obtaining a peak of height 3cr or 
higher for the PDF in question, relative to that for a 
Gaussian model. A value G > 1 indicates an excess of 
large positive fluctuations, G < 1 indicates a deficit, while 
G = 1 indicates Gaussian fluctuations. Since rich clusters 
typically form from 3er or higher peaks in the primordial 
fluctuations, this parameter is a useful quantifier of non- 
Gaussianity in this context. 

2.3. Background Cosmology 

The background cosmology is specified by two parame- 
ters, the fraction of critical density at z — contributed 
by dark matter (f2 m ) and the fraction contributed by a 
cosmological constant (Qa)- We restrict our investigation 
to the flat (fi m +fiA=l) an d open (J7a = 0) cases. 

2.4. Cluster Mass-Temperature relationship 

The Press-Schechter (PS) formalism (Press & Schechter 
1974) allows us to predict the number density of clusters 
in the universe as a function of cluster mass. This for- 
malism has been adapted to the non-Gaussian case, by 
Colafrancesco, Lucchin & Matarrese (1989), Chiu et al. 
(1997), and RGS98 and has been shown in RB00 to fit 
the cluster evolution observed in non-Gaussian models of 
structure formation to better than 25% accuracy. In par- 
ticular, the number density of clusters with masses be- 
tween M and M + dM is given by 



n(M)dM 



3/ 



4xR{MJ 



rP[y c (M)} 



d[yc(M)} 
dM 



dM. 



(10) 
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Here R(M) is the Lagrangian (pre-collapse) radius of a 
sphere giving rise to a cluster of mass M, which satisfies 



M 



An 



R{Mfp b 



(11) 



with pi, being the comoving background density of the uni- 
verse. Also 



y c (M) = 5 c /a M 



(12) 



where <jm is the rms mass fluctuation on scale M, linearly 
extrapolated to redshift z, and S c is the critical overdensity 
for collapse, which has the value 1.69 in a flat universe (for 
fits to the weak cosmological dependence, see Kitayama & 
Suto 1996). Finally, / is a correction factor, given by 



/ = 



1 



fo°°dyP(y) 



(13) 



which is included to ensure that the mass function ac- 
counts for the entire mass of the universe. In the Gaussian 
case / takes the value two, as originally proposed by Press 
& Schechter (1974) and explained by Bond et al. (1991) 
using the excursion set formalism. 

The Press-Schechter formalism allows us to accurately 
predict the cluster number density as a function of cluster 
mass. For most of the clusters considered in this work we 
observe not the mass but the temperature of the intra- 
cluster medium (via the spectrum of emitted X-rays). We 
therefore need to make use of a mass-temperature relation 
in order to compare our models with observations. Hy- 
drodynamical simulations of cluster formation (Navarro, 
Frenk & White 1995; Evrard, Metzler & Navarro 1996; 
Eke, Navarro & Frenk 1998) suggest that the relationship 
between mass and temperature can be well fitted by 



fc^ qas 



9.37 



M 



(3(5X + 3) VlO 15 /i- 1 M 



2/3 



(1- 



1/3 



1/3 



(14) 

where Ac is the ratio of the mean halo density to the crit- 
ical density at redshift z (for fitting functions in various 
cosmologies, see Kitayama & Suto 1996), X is the hydro- 
gen mass fraction (we take X = 0.76), and (3 is the ratio of 
the specific galaxy kinetic energy to the specific gas ther- 
mal energy. Numerical simulations suggest that the aver- 
age value for this quantity is in the range (3 = 1.0 — 1.3. 
For the bulk of this work we take (3 = 1.0, although we 
also investigate the effect of varying the value. Unfortu- 
nately, knowledge of the mean value of (3 is not enough 
to specify our models, as the mass temperature relation is 
also subject to scatter, with some clusters of a given mass 
being hotter and other clusters being colder. Simulations 
suggest that the spread of (3 values can be reasonably fit 
as a log- normal distribution, that is log 10 (/?) is Gaussianly 
distributed, with mean log 10 (/3) and variance up. Using 
the distribution of (3 values, we can infer the probability 
p(T\M)dT of a cluster of mass M having a temperature be- 
tween T and T+dT. The expected number density n(T)dT 
of clusters with temperature between T and T+dT is then 
given by 



Due to the fact that n(M) for typical models is a sharply 
falling function of M, increasing the scatter up system- 
atically boosts n(T). We adopt up = 0.065 as a fidu- 
cial choice, though we also investigate the effect of varia- 
tions about this value. The additional scatter in the mass- 
temperature relation caused by temperature measurement 
errors is typically smaller than the intrinsic scatter dis- 
cussed above, but we can account for it in the same way 
by adding temperature measurement errors in quadrature 
to a p. Obviously, our assumptions about the form of 
P(T\M) will affect the inferred likelihood of observing a 
cluster with a given temperature. We will see later that 
our results are relatively insensitive to the amplitude of 
the scatter in temperature values for a given mass, and 
consequently it is reasonable to assume that the particu- 
lar choice of a log-normal distribution for this scatter does 
not bias our results. 

To summarize, our models arc parameterized by 7 pa- 
rameters: T, eg, G, fl m , fl\, (3 and up. 

3. DATA 

We extract information on the redshift evolution of clus- 
ters from three sources: 

• Henry & Arnaud clusters: This is an X-ray selected 
sample of clusters with galactic latitude b > 20° and 
a flux limit / m i n = 3xl0~ n ergcm~ 2 s _1 in the 2—10 
keV band (Henry & Arnaud 1991). The sample con- 
tains 25 objects, with a median redshift z = 0.05, 
and is at least 90% complete (for the purposes of this 
work we shall assume 100% completeness, although 
it makes no difference to our conclusions). 

• Markevitch clusters: This is a sample of clusters se- 
lected from the ROSAT All-Sky Survey Abell cluster 
list (Ebeling et al. 1996) plus three known bright 
non-Abell clusters, all with b > 20° and cooling-flow 

k e y corrected flux greater than / m ; n = 2x 10~ n erg cm~ 2 
s" 1 in the 0.1 - 2.4 keV range (Markevitch 1998). In 
addition, the sample is volume limited by requiring 
Zmin < z < z max with z min = 0.04 and z max = 0.09. 
For an explanation of the cooling flow correction pro- 
cedure, see Markevitch (1998). We make use of both 
cooling flow corrected and uncorrected temperatures 
in this analysis. 

• EMSS clusters: The EMSS clusters are selected from 
an X-ray survey in the 0.3-3.5 keV band covering a 
total area A to t — 735 square degrees. The fraction of 
survey area A sensitive to fluxes larger than /dot can 
be fit to better than 10% accuracy (Eke et al. 1998) 
for fluxes greater than 2.5 x 10~ 13 erg cm' 2 s _1 by 



Mf, 



dctj 



A, 



1 - 3.05e-°- 41/dct + 2.30e- a77/dot . (16) 



n{T)dT = / p(T\M)n(M)dM 



(15) 



We consider a volume limited sample of 9 clusters 
with 0.3 < z < 0.4 whose temperatures have been 
measured by Henry using the ASCA satellite (Henry 
97). We also consider a sample comprising two 
objects with ASCA temperatures (MS0015.9+1609, 
z = 0.54, kT = 8.0 ± 0.6 keV, Mushotzky & Scharf 
1997; MS0451.6-0305, z = 0.54, kT = 10.4 ± 1.2 
keV, Donahue 1996), for which we take the redshift 
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limits to be 0.5 < z < 0.65. Finally, we consider an- 
other sample comprising one object (MS1054. 5-0321, 
z = 0.83, kT = 12.3±|"i keV, Donahue et al. 1998), 
with redshift limits taken to be 0.65 < z < 0.9. We 
will investigate whether this choice of redshift limits 
affects our results. 

We analyze each sample as follows: First, we model the 
relationship between luminosity (L) and temperature for 
each sample as a Gaussian distribution in log(L), with 
mean given by 

log 10 (W(10 44 erg s- 1 )) = A, log 10 (fcT/kcV) + A 2 (17) 

and a standard deviation 



sum runs over all clusters), using four different assump- 
tions about the LT relationship. For "Fiducial LT" we 
use the LT model and parameters discussed above. For 
"High LT" we multiply luminosities in the fiducial model 
by a factor of 10, and for "Low LT" we divide by a factor 
of 2. For "Actual Luminosities" we revert to the method 
for calculating V^, ' -■ ,! •■ ; ;il 
(1998a), that is 



for the i cluster used by Eke ct al. 



( 7 L =A 3 log 10 (fcT/kcV)+A 4 



(18) 



The subscript X denotes the observing band for the sam- 
ple in question, and fiducial values for the parameters A\, 
A2, A3 and A4 for each of the cluster samples are (see 
Eke et al. 1998): Markevitch A x = 2.1, A 2 = -1.485, 
A 3 = 0.0, A 4 = 0.104; Henry & Arnaud A x = 3.93, 
A 2 = -2.92, A 3 = -0.52, A 4 = 0.70; EMSS clusters 
A x = 3.54, A 2 = -2.85, A 3 = -0.47, A 4 = 0.69. Hav- 
ing assumed a luminosity-temperature (LT) relation, we 
compute the average volume V max (T) in which a cluster of 
temperature T could be seen for the sample in question: 



Knax(T) = / dL P (L\T) 



dz 



d 2 V 
duodz 



xui{L, z) $(L - 4:TTf mm d 2 L (z)) 



(19) 



wherep(L|T) is the probability distribution of L for a given 
value of T (defined above), d 2 V / (dudz) is the comoving 
volume element per unit redshift per solid angle, u)(L, z) 
is the solid angle associated with flux limit L at redshift 
z, dh is the luminosity distance, and 6(x) is the Heaviside 
function 

*<*) = { I :::*<S- w 

The LT relation for each sample is subject to various un- 
certainties, and at high redshifts the number of clusters for 
which luminosity and temperature have been observed is 
so low that any accurate determination is virtually impos- 
sible. For flux limited samples, uncertainties in p(L\T) will 
lead to uncertainties in V max , and therefore to uncertain- 
ties in the inferred number density of clusters. However, 
for surveys which are limited in both flux and volume, suf- 
ficiently hot clusters will typically be so bright that for all 
reasonable luminosities they could be seen throughout the 
entire survey volume, so the uncertainty in V max is much 
smaller. We investigate this effect by plotting the cumu- 
lative temperature function 



N >T = 



2.^ yi 



1 



(21) 



for the Markevitch and Henry samples discussed above 
(N > t is the number density of clusters with temperature 
greater than T, Ti and V^ ax are the temperature and max- 
imum volume for the ith cluster in the sample, and the 



V 1 (T) 

max V / 



dz 



d 2 V 
dwdz 



u(L i ,z)e(L i -4wf mi nd 2 L (z)), 

(22) 
where L % is the luminosity of the i th cluster. 

Results for these four cases in a critical density universe 
are shown in Figures 1 and 2. The Markevitch sample 
(Fig. 1), which is volume limited, shows virtually no de- 
pendence on the LT relationship for T > 6.3 keV, and 
the Henry sample (Fig. 2) is similarly robust for this 
temperature range. For the remainder of this work, we 
will concentrate our analysis on clusters with tempera- 
tures T > 6.3 keV for the low redshift samples, T > 8.0 
keV for the 0.5 < z < 0.65 sample, and T > 10 keV 
for the 0.65 < z < 0.9 sample. As discussed, restrict- 
ing our analysis to the highest temperature clusters re- 
duces the dependence of our results on uncertainties in 
the LT relation. As a consequence, we will not worry 
about the effect of X-corrections, whose typical effect on 
the luminosity is less that the uncertainties in the LT re- 
lation. An additional advantage of restricting our study 
to the highest temperature clusters is that heating of the 
intra-cluster medium by supernovae (which could affect 
the cluster mass-temperature relation) is likely to be least 
important in this regime, as discussed by Viana & Liddlc 
(1999). 
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Fig. 1. — Cumulative temperature function for the Marke- 
vitch (1998) cluster sample, assuming Q. m — 1, S7a = 0, and 
four different versions of the luminosity-temperature relation, 
as discussed in the text. 

We comment briefly on the adopted slope of the LT 
relation. For the Markevitch sample, we adopt a slope 
A\ = 2.1 as measured by Markevitch (1998) for the 0.1- 
2.4 keV observing band. This corresponds to a slope for 
the bolomctric luminosity vs temperature relation of 2.64. 
We follow Eke et al. (1998) in adopting a slope A x = 3.93 
for the Henry and Arnaud sample, and Ai = 3.54 for the 
EMSS clusters. Self-similar scaling arguments on the other 
hand would imply a slope A\ = 2 (Kaiser 1986), while a 
different study of the low redshift LT relation (Arnaud 
& Evrard 1999) finds a slope of 2.88. As discussed, our 
analysis is not strongly affected by uncertainties in the LT 
relation, and variations of the slope within the range sug- 
gested by the observations do not significantly alter our 
conclusions. 



Fig. 2. — Cumulative temperature function for the Henry 
(1997) cluster sample, assuming fl m — 1, J7a = 0, and four dif- 
ferent versions of the luminosity-temperature relation, as dis- 
cussed in the text. 

From the above discussion we anticipate that for some 
range of temperatures T m j n < T < T mllXl the maximum 
volume V max will be robust, depending only weakly on un- 
certainties in the LT relationship. Exploiting this fact, we 
can make use of a simple expression for the likelihood func- 
tion: assuming that the clusters are Poisson distributed, 
the likelihood C of observing a sample of iV c clusters given 
a temperature function n(T) is 



»=1...JV 



[] (i - ncrowr,-)) 

j=l...JV b -JV 

where the temperature range has been divided into a set 
of iVb bins, the sum i runs over all bins containing a clus- 
ter, the sum j runs over all bins not containing a cluster, 
and the bins are sufficiently small that the probability of 
finding more than one cluster in a bin is negligible. In the 
limit that the bin size tends to zero, ln£ satisfies (up to 
an additive constant Cq) 



ln£ = V In (n(T t )V max (T t ))- / n(T)V max (T)dT+C 

JT min 

(24) 

where the index i runs over all clusters in the sample satis- 
fying T m i n < Ti < T max . Provided V max is well determined 
for the range of temperatures we will consider, this expres- 
sion gives a robust determination of the likelihood. Eke 
et al. (1998) have used a very similar expression for the 
likelihood which also takes account of individual redshifts 
within a cluster sample. However, since the n(T) laws 
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do not change too dramatically within the redshift ranges 
probed by each of our samples, we choose instead to work 
just with n(T) for the median redshift of each sample. 



4. RESULTS 

4.1. Current Data 

We now consider in detail the constraints we can place 
on Q m and non-Gaussianity using current observational 
data. As we will see later, the results do not depend 
strongly on whether the universe is flat or open, so for 
the time being we will concentrate on the case of an open 
universe with zero cosmological constant. To start our dis- 
cussion, we consider the constraints placed by just two of 
the datasets: the Markevitch data at low redshift, and the 
0.65 < z < 0.9 data containing MS1054.5-0321 at high 
redshift. Figure 3 shows confidence limits derived by com- 
bining these two datasets: in the Q m — as plane assuming 
an open universe with G = 1 (left panel), in the G — a% 
plane assuming f2 m — 1 (central panel) and in the G — o% 
plane assuming Q m — 0.3 (right panel). The solid lines 
show 90% confidence limits for the Markevitch data, the 
dotted lines show 90% limits for the 0.65 < z < 0.9 data, 
and the dark and light shaded regions show 68% and 90% 
confidence regions for the combined data sets. Confidence 
limits are computed assuming uniform priors in all vari- 
ables. Under the assumption of Gaussianity, the combined 
datasets prefer Ct m ~ 0.3 with fl m = 1 excluded at more 
than the 2a level, though we will see in a moment that var- 
ious systematic uncertainties might affect this conclusion. 
A critical universe is consistent with the two datasets if 
the fluctuations arc non-Gaussian with G > 2.5. 
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Fig. 3. — Confidence limits from the Markevitch data (solid 
lines, 90% limits), the 0.65 < z < 0.9 data (dotted lines, 90% 
limits), and for the combined data sets (dark shaded region, 
68% limits; light shaded region, 95% limits), for the cases G = 1 
(left panel), Q m = 1 (central panel) and fl m = 0.3 (right panel). 
The vertical long-dashed lines in the center and right panels 
show the location of Gaussian fluctuations. 

The discussion above, which uses just two of our 
datasets, illustrates the basic degeneracy (in the Q m — a$ 
or G — as plane) intrinsic to cluster observations at a single 
redshift, and how this degeneracy is broken using obser- 
vations of cluster evolution. Before embarking on a full 
likelihood analysis incorporating all of the datasets, we 
will check the consistency of the various datasets and in- 
vestigate various systematic effects which might alter our 
conclusions. 
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Fig. 4. — Likelihood (marginalized over ag) for the 0.65 < z < 
0.9 data combined with different low redshift data, as labeled. 
The dotted horizontal lines in this and the following figures 
show likelihood thresholds corresponding to la (top line) and 
2(j (bottom line) confidence limits, in the approximation that 
the likelihood function is Gaussian. 

First, we consider the effect of using different low red- 
shift data. Figure 4 shows the likelihood (marginalized 
over us assuming a uniform prior) for four different choices 
of low redshift data, for the cases G = 1, il m = 1, and 
fl m = 0.3. The low redshift data considered are the Marke- 
vitch sample, the Markevitch sample with temperatures 
not corrected for cooling flows, the Henry and Arnaud 
sample, and the Henry and Arnaud sample with a red- 
shift limit of z < 0.09 imposed. The likelihood functions 
are almost identical (in particular, the temperature correc- 
tions in the Markevitch sample make very little difference) , 
with the exception of the Henry and Arnaud data without 
a volume limit. The difference in this case arises because 
the hottest clusters in the sample are luminous enough to 
be visible at redshifts larger than z = 0.09. For instance, 
the hottest cluster in the sample (A2142) is sufficiently 
luminous to be detected out to a redshift of 0.014, imply- 
ing y max = 1.6 x 10 8 /i~ 3 Mpc 3 , while the imposition of the 
redshift limit reduces V max to 4.6 x 10 7 /i~ 3 Mpc 3 . Simi- 
lar changes in L max apply to the two next most luminous 
clusters, implying that the three brightest clusters are all 
found within the closest 1/3 of the volume in which they 
could be detected. The fact that no clusters are actually 
observed at z > 0.09 in this sample is either a chance oc- 
currence (the event has moderate statistical significance) 
or suggests that either the luminosity-temperature relation 
or the number abundance of clusters evolves significantly 
beyond z = 0.09. The latter two effects would invalidate 
our analysis, since we do not take into account any evo- 
lution within a single redshift bin. However, this prob- 
lem can be eliminated by making sure the redshift bin is 
sufficiently small, for instance by limiting the maximum 
redshift. The imposition of a volume limit does indeed 
yield results which arc in extremely close agreement with 
those from the Markevitch sample, and we make use of 
the Markevitch data to normalize the low redshift cluster 
abundance for most of the remaining discussion. Later on 
we will check whether using the Henry & Arnaud or the 
Markevitch sample makes a significant difference to our 
final results. 
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Fig. 5. — Likelihood (marginalized over as) for the Markevitch 
data, combined with different high redshift data, as labeled. 

Second, we consider the effect of using different high 
redshift datasets. Figure 5 shows the likelihood (marginal- 
ized over eg ) for three different choices of high redshift 
data, for the cases G = 1, fL„ = 1, and fl m = 0.3. The 
Henry data contains the most clusters and is the best un- 
derstood systematically, as it is complete to a known flux 
limit. However, even the most extreme models are not 
expected to show very much evolution at this moderate 
redshift, and very little of the available parameter space 
is ruled out at the 2<r level. The constraints from the 
two clusters in the interval 0.5 < z < 0.65 are somewhat 
stronger, with Q m = 1 with Gaussian fluctuations being 
inconsistent at the 2tr level. However, the greatest statisti- 
cal weight comes from the highest redshift sample, despite 
the fact that it contains only one cluster. (For most of 
the remaining discussion we consider a model to be in- 
consistent at the 2cr level if ln(L/L max ) < — 2, and at the 
Id level if ln(L/L max ) < —0.5, which are the thresholds 
which would apply if the likelihood function were Gaussian 
- most of our likelihood functions are close to Gaussian at 
their peak, so this approximation should be reasonable. 
These thresholds are shown by horizontal lines in our like- 
lihood plots). 

Next we investigate the effect of uncertainties in the 
LT relation on the allowed ranges of Cl m and G. Figure 
6 shows the likelihood for a variety of choices of LT re- 
lationship at low and high redshifts. As noted, we have 
attempted to restrict our analysis to temperatures which 
are sufficiently high that volume limits are more impor- 
tant than flux limits, thus minimizing the importance of 
the LT relation. We see from the figures that changing 
the LT relation at high redshift has relatively little effect 
on the likelihood function. The only change that makes a 
significant difference is reducing the amplitude of the LT 
relationship at low redshift. Such a change would imply 
that the true number density of clusters of a given tem- 
perature is in fact higher, increasing the inferred level of 
cluster evolution and increasing the preferred value of fl m 
(or reducing the preferred value of G in the non-Gaussian 
case). However, even for the extreme case considered here, 
where the mean luminosity is reduced by a factor of 2, the 
changes to the likelihood function are not large. 



Fig. 6. — Likelihood (marginalized over as) for the Markevitch 
data and the 0.65 < z < 0.9 data under different assumptions 
about the cluster luminosity-temperature relationship, as dis- 
cussed in the text. 
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Fig. 7. — Likelihood (marginalized over as) for the Markevitch 
data and the 0.65 < z < 0.9 data, under different assumptions 
about the temperature of MS1054. 5-0321. The dotted horizon- 
tal lines show likelihood thresholds corresponding to la (top 
line) and 2a (bottom line) confidence limits. 

Now we investigate the effect of uncertainties in the mea- 
sured temperature of the high redshift cluster MS1054.5- 
0321. In Figure 7 we show the likelihood function 
marginalized over og for the cases G = 1, Q m = 1, 
and Q m — 0.3 under four different assumptions about 
the true temperature of the cluster (T = 10.0 keV, 
T = 12.0 keV, T = 14.7 keV and T = 16.0 keV). 
Assuming an M < r cx R 06i cluster profile, the corre- 
sponding masses within a 1.5Mpc comoving radius are 
0.62 x KF/T^M©, 0.76 x 1O 15 /i _1 M , 0.96 x 10 15 /i- 1 M Q 
and 1.06 x 10 15 h~ 1 M & respectively if fl m — 1, and 
0.70 x 10 15 /i- 1 M Q , 0.86 x W^H^Mq, 1.09 x lO^/T^M© 
and 1.20 x 1O 15 /i _1 M respectively if Q, m = 0.3. These 
possibilities span the range allowed by other estimates of 
the mass of the cluster from weak lensing and velocity dis- 
persion observations (Bahcall & Fan 1998a). For lower 
cluster temperatures, the preferred value of f2 m is higher 
(or in the non-Gaussian case the preferred value of G is 
higher). However, even for the lowest value of T consid- 
ered, an Q m = 1 universe with Gaussian fluctuations is 
inconsistent at the 2a level, with the best fit value of fl 
(or G in the non-Gaussian case) almost unchanged. 
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Next we consider the effect of varying the range of tem- 
peratures r m ; n < T < T max over which we perform our 
likelihood analysis for the high rcdshift sample. In Figure 
8 we show the likelihood function marginalized over cr 8 for 
the cases G = 1, fl m = 1, and fl m — 0.3 for T m ; n = 6.3 
keV, T min = 10.0 kcV, and T min = 12.0 kcV. For lower 
values of T m j n it becomes increasingly more likely that the 
0.65 < z < 0.9 sample is incomplete, as there are fainter 
clusters in this redshift range whose temperatures have 
not yet been measured. Our analysis assuming T min = 6.3 
keV probably underestimates the true number of clusters 
in this temperature range (which it takes to be one), and 
therefore over-estimates the degree of cluster evolution, 
consequently over-estimating Q m (or under-estimating G 
in the Gaussian case). Our analysis assuming T min = 12 
keV is less likely to be subject to incompleteness. However, 
in the case that the true temperature of MS1054. 5-0321 
turned out to be less than 12 kcV, our sample would have 
over-estimated the number of clusters within the temper- 
ature range (as one instead of zero), making the analysis 
invalid. The case T m ; n = 10.0 keV lies in the middle, and 
the two cases just discussed can be considered to repre- 
sent a wide range of systematic error from incompleteness 
of the high redshift sample. In the Gaussian case, the 
best fit value of fL n is in the range Vt m = 0.3 ± 0.15, but 
no choice of T m i n is able to reconcile a Gaussian Q m = 1 
universe with the data at the 2a level. 
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Fig. 9. — Likelihood (marginalized over as) for the Marke- 
vitch data and the 0.65 < z < 0.9 data, for different versions 
of the mass-temperature relationship. 
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Fig. 8. — Likelihood (marginalized over as) for the Markevitch 
data and the 0.65 < z < 0.9 data, for different values of the 
parameter T m i n . The dotted horizontal lines show likelihood 
thresholds corresponding to la (top line) and 2a (bottom line) 
confidence limits. 

Next we consider the effect of changes in the mass tem- 
perature relationship on our likelihood analysis. Figure 9 
shows the likelihood function marginalized over as for the 
cases G = 1, £l m = 1, arid il m = 0.3 for different values of 
the parameters (3 and ap. The maximum likelihood val- 
ues and exclusion limits are not significantly changed by 
any of these modifications. Also, we consider the effect of 
varying the power spectrum of the fluctuations. Figure 10 
shows the likelihood function marginalized over a% for the 
cases G = 1, Vt m = 1, and Q, m = 0.3 and a CDM shape 
parameter T in the range 0.05 < Y < 0.5, wide enough 
to encompass any viable model. Exclusion limits are not 
significantly altered by variations of T in this range. 



Fig. 10. — Likelihood (marginalized over as) for the Marke- 
vitch data and the 0.65 < z < 0.9 data, for different values of 
the power spectrum shape parameter I\ 



4.2. Tests of Methodology 

Now we consider the effect of possible systematic er- 
rors in the modified Press-Schcchtcr prediction for non- 
Gaussian models. In RB00 it has been shown that the 
Press-Schcchter formula can model the number abun- 
dance of clusters observed in N-body simulations with 
non-Gaussian initial conditions to better than 25% accu- 
racy. We test the effect of this degree of systematic un- 
certainty by multiplying the low and high redshift predic- 
tions of the cluster number abundance by factors F\ and 
F-i respectively. Figure 11 shows the likelihood function 
marginalized over as for the standard choice F\ — F% = 1, 
a model with enhanced evolution for which F\ = 1.25 and 
F2 = 0.75, and a model with less evolution, for which 
F\ = 0.75 and Fi — 1.25. Increasing the degree of evo- 
lution slightly increases the preferred value of f2 m , or re- 
duces the preferred value of G in the non Gaussian case, 
and decreasing the degree of evolution has a small effect 
in the opposite direction. Even in this extreme case how- 
ever, where we have changed the relative number densities 
at low and high redshift by a factor of more than 50%, 
considerably more than the uncertainty suggested by the 
simulations of RB00, there is little effect on the results of 
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our likelihood analysis. 



the likelihood function marginalized over as for the cases 
G = 1, £l m = 0.3, and Q m = 0.6 for open and flat uni- 
verses. The lambda case typically predicts slightly more 
evolution than the open case if the other parameters are 
held fixed. Consequently, the best fit values of Vt m are 
slightly lower, or in the non-Gaussian case, the best fit 
value of G is slightly higher. 
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Fig. 11. — Likelihood (marginalized over as) for the Marke- 
vitch data and the 0.65 < z < 0.9 data, for different values 
of the parameters Fi and F2, included to account for possi- 
ble systematic errors in the Press-Schechter prediction for non- 
Gaussian models. 

One additional source of systematic error is the possibil- 
ity that the Press-Schechter formalism could break down in 
the regime of very rare peaks. In RBOO the non-Gaussian 
Press-Schechter formalism has been tested in 100ft. -1 Mpc 
simulations, where limited volume restricts the possibility 
of testing the density of the rarest clusters. For the case of 
Gaussian fluctuations, Governato et al. (1999) have car- 
ried out very large simulations (containing 4 x 10 7 parti- 
cles) and suggest that the PS formalism may under-predict 
the density of the rarest peaks by a factor of about 10. 
Their observed halo abundance can be well fit by a mod- 
ified PS formula where the critical overdensity 5 C varies 
with redshift and fluctuation amplitude via 



5 r = 1.685 



0.7(1 + z) 



(Ja 



(25) 



In the central panel of Figure 11 we show the result for 
the likelihood function derived using this modified form 
for 5 C (long-dashed line). The resulting likelihood func- 
tion differs little from that obtained using the standard 
PS formalism, and in particular the case of Gaussian fluc- 
tuations with £l m = 1 is still ruled out at the 2a level. 
From this discussion we infer that uncertainties about the 
validity of the PS formalism in the rare peak limit should 
not have a large effect on the conclusions of this paper. 

Finally, we show the effect of assuming a flat universe 
with a cosmological constant, rather than the open case 
which we have considered up to now. Figure 12 shows 



4.3. Combined results 

Having considered the importance of various sources 
of systematic error, we now show the likelihood function 
derived by combining all of the cluster data at different 
redshifts. In Figure 13 we show the likelihood function 
(marginalized over erg) for the cases G = 1, H m = 1.0 
and £l m — 0.3, using the combination of the Markevitch, 
Henry, 0.5 < z < 0.65 and 0.65 < z < 0.9 datasets. We 
show the results for open and flat universes together with 
two additional analyses for the open case. The first, de- 
noted "Low $1", incorporates the major systematic uncer- 
tainties favoring a low value of fl m . The second, denoted 
"High 51" , incorporates the major systematic uncertain- 
ties favoring a high value of £l m . For the "Low il" model, 
we replace the Markevitch data at low redshift with the 
Henry and Arnaud sample, without a volume limit, and we 
adopt a lower temperature bound of T m i n = 12.0 keV for 
our analysis of the 0.65 < z < 0.9 sample. For the "High 
$1" model, we multiply the Press-Schechter prediction for 
the low redshift sample by a factor F\ — 0.75, and the pre- 
dictions for the 0.5 < z < 0.65 and 0.65 < z < 0.9 samples 
by a factor F2 = 1.25. We also adopt a normalization for 
the mass-temperature relationship of (3 = 1.3, use a tem- 
perature for MS1054.5-0321 of lO.lkeV, and adopt a lower 
temperature bound of T m i n = 6.3 for the analysis of the 
0.65 < z < 0.9 sample. The results from the "High $1" and 
"Low Q" analyses span a reasonable range of systematic 
uncertainty in the true likelihood function. 
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Fig. 12. — Likelihood (marginalized over as) for the Marke- 
vitch data and the 0.65 < z < 0.9 data, for open and flat 



universes. 



For interpreting most of the results in this paper, we 
have made the simplifying assumption that the likelihood 
levels corresponding to a given confidence limit are just 
the levels that would be appropriate for a Gaussian distri- 
bution. In the Gaussian case the la limit is ln(L/L max ) > 
—0.5 and the 2a limit is ln(L/L max ) > —2. Clearly some 
of our likelihood functions are not well approximated as a 
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Gaussian distribution, and for this final analysis we quote 
precise limits for each likelihood curve, assuming uniform 
priors in O m and log 10 (G), with the parameters restricted 
to lie in the range plotted. We find that la thresholds are 
within the range —0.57 < ln(L/L max ) < —0.48 and the 
2cr thresholds are within the range —2.07 < ln(L/L max ) < 
— 1.93 (that is, very close to the Gaussian levels) for al- 
most all the curves in Figure 13. The only exceptions are 
the Open and "Low Omega" models in the il m = 0.3 case, 
for which the la limit is L/L max > —0.23 and the la limit 
is L/L max > —1.1, and the "High Omega" model in the 
£l m = 0.3 case, for which the la limit is L/L max > —0.23, 
and the la limit is L/L max > —1.75. 

To summarize the results of the full likelihood analysis, 
we find that for a Gaussian universe, the best fit value of 
Vt m is in the range tt m ~ 0.4 ± 0.25, with Q m = 1 incon- 
sistent at the 2d level independent of almost all sources of 
systematic error. The Gaussian, f2 TO = 1 case can only be 
reconciled at the 2a level by a conspiracy of several sys- 
tematic effects working in the right direction. If we assume 
that Vt m — 1, then the predicted degree of cluster evolution 
can be reconciled with the observations if G > 2.0, with 
the best fit value being G ~ 6.5 ± 2.0. Under the assump- 
tion that Q m — 0.3, Gaussianity is always consistent with 
the data, but a wide range of non-Gaussian models also 
fit, with all values G < 4 (or G < 6 in the lambda case) 
acceptable at the 2a level. These conclusions are robust to 
a wide range of systematic uncertainties, and independent 
of whether the universe is open or flat. 
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of non-Gaussianity. In Figure 15, we show the relationship 
between skewness S, kurtosis K, and non-Gaussianity pa- 
rameter G for the log normal family of PDFs we are con- 
sidering. Since the PDF is normalized to have mean zero 
and standard deviation one, the skewness is just 



y 3 P(y) dy. 



while the kurtosis is 



/oo 
y 4 P(y) dy - 3. 
-OO 



(26) 



(27) 



The G — 2.5 case has a skewness of 0.3, and a kurtosis of 
0.1, while the G — 9.0 case has a skewness of 1.2 and a 
kurtosis of 2.6. Some typical G values for physical mod- 
els are: cosmic strings - G ~ 5 (RB00), cosmic textures 
- G ~ 14 (Park, Spergel & Turok 1991), Peebles ICDM 
G ~ 15 (RB00). The latter two models would therefore 
appear to be ruled out if the matter density of the universe 
is of order 0.3. 




Fig. 13. — Likelihood (marginalized over as) for all datasets 
combined, for an open universe, a lambda universe, and for 
"Low f2" and "High f2" analyses which incorporate a number 
of systematic effects favoring low and high values of f2 m , re- 
spectively. 

We discuss briefly the degree of non-Gaussianity which 
these G values represent. Figure 14 shows probability dis- 
tribution functions with G = 1, G = 2.5 and G = 9. We 
see that the Gaussian and the G = 2.5 cases are barely dis- 
tinguishable, with the differences only becoming apparent 
for the rare tail of fluctuations. We reiterate that the level 
of non-Gaussianity represented by the G = 2.5 PDF is suf- 
ficient to reconcile an Q m = 1 universe with the observed 
degree of cluster evolution. The G = 9 PDF differs from 
the Gaussian case more obviously over its entire range, not 
just for the rare tail of perturbations. We can also relate 
the parameter G to some more familiar characterizations 



Fig. 14. — One Gaussian and two non-Gaussian PDFs, 
non-Gaussianity parameters as labeled. 



with 



From this discussion, we see that the level of non- 
Gaussianity required to reconcile an Q m = 1 universe with 
existing cluster evolution data is relatively small, and less 
than the amount predicted by many well motivated non- 
Gaussian models. One promising scenario which could 
'save' Q m — 1 is a special form of "hybrid inflation" , where 
an initial spectrum of adiabatic Gaussian fluctuations is 
further perturbed by the evolution of a network of cosmic 
defects (Contaldi, Hindmarsh & Magueijo 1999; Battye 
& Weller 1998). The level of non-Gaussianity in such a 
model would be lower than that resulting from the action 
of the defect network alone. Of course, for this model to 
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work it would also have to overcome a number of indepen- and it may be necessary to consider the full form of the 

dent arguments against a critical density universe (see e.g. distribution. 

Bahcall & Fan 1998b). 
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Fig. 15. — Relationship between skewness and non- 
Gaussianity parameter G for the log-normal family of PDFs. 

As one last point, we comment on our particular choice 
of the parameter G as a quantifier of non-Gaussianity. Let 
us consider a more general non-Gaussianity statistic, G„, 
defined as 

J 00 P{y)dy 
Gn = 2lT T^ — ,,i< 2 . ■ ( 28 ) 

G n is the probability of obtaining a peak of height no or 
higher, relative to that for a Gaussian distribution. For 
the bulk of the discussion in this paper we make use of 
the parameter G = G3. Figure 16 shows the parameters 
Gi, G2, G3, G4 and G5 as a function of G, for the case 
of a log-normal PDF. Although the plot has been pro- 
duced using a log-normal distribution, an almost identical 
plot would result for a \ 2 PDF. We have found that log- 
normal and x 2 distributions with the same value of G3 
give very similar predictions for the cluster abundance, at 
least over the range of masses and redshifts discussed here. 
Since log-normal and \ 2 PDFs with the same value of G 
have very similar values of G2, G4 and G5, any of these 
parameters would also be an acceptable choice for param- 
eterizing non-Gaussianity. We concentrate on G3 since the 
collapse threshold of a typical cluster in the local universe 
is about 3cr, so that the value of G3 has a simple physical 
interpretation. By contrast, the parameter G\ would not 
be a useful quantifier of non-Gaussianity, since its value 
is almost independent of G. In extreme cases where the 
PDF is not accurately fit by a log-normal or \ 2 distribu- 
tion, the use of the single parameter G may be inadequate, 



Fig. 16. — Relationship between various non-Gaussianity 
parameters G n (see equation |28|) and our preferred non- 
Gaussianity parameter G, for a log-normal PDF. Results for 
a x 2 PDF are almost identical. 



4.4. Future data 

We have seen that existing cluster evolution data 
strongly disfavors an fl m = 1 universe with Gaussian fluc- 
tuations, preferring either a low value of f2 m , or a high 
value of G. An independent measure of Jl m (from CMB 
data and supernovae observations for example) would 
therefore allow us to draw conclusions about the degree 
of non-Gaussianity in the universe. Let us consider three 
different models which are roughly degenerate with respect 
to current cluster data: a model with il m = 1.0, G = 8.0, 
as = 0.35, a model with £l m = 0.6, G = 4.5, cr s = 0.525, 
and a model with fi m = 0.3, G = 1.0, erg = 0.85. The 
likelihood function of these models relative to the obser- 
vational data considered above is shown in Figure 17. To 
evaluate these likelihood functions, we have assumed that 
fl m is measured to within 0.05 for the £l m < 1 cases, and 
then marginalized over this uncertainty. For the O m = 1.0 
model, assuming that we have an unambiguous determi- 
nation of the true value of il m , the current cluster data 
gives a clear detection of the non-Gaussianity of the fluc- 
tuations, as discussed above. For the O m = 0.6 model, the 
current observational data is unable to detect the non- 
Gaussianity, with Gaussian fluctuations allowed at the 2a 
level. For the fi m = 0.3 case, Gaussian fluctuations are 
consistent with the data, with the non-Gaussianity pa- 
rameter constrained to be G < 4.0 at the 2a level. 
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non-Gaussianity would be possible at the la level. In the 
n m = 0.3 case, the allowed range of G is not significantly 
reduced, with the limit still being G < 4 at the 2a level. 
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Fig. 17. — Likelihood (marginalized over as) for the three 
models discussed in section 4.4 and the combination of the 
Markevitch and 0.65 < z < 0.9 datasets. The models are la- 
beled by the value of fi m , and analyzed assuming that the true 
value of Q m has been measured precisely in each case. 

For each of the models just mentioned, we have also 
computed likely confidence limits which could be achieved 
by a considerable but realistic increase in the volume of 
cluster evolution data. For each model, we have gener- 
ated mock realizations of the Markevitch catalogue, and of 
a complete EMSS catalogue, assuming that temperatures 
are measured for all clusters within the EMSS survey for 
which T > 6.3keV and z < 0.9. To generate these cata- 
logues, we split the surveys into a number of redshift slices 
(one slice for the Markevitch data, with the same redshift 
and flux limits and the same sky coverage, and four slices 
for the EMSS data with 0.3 < z < 0.4, 0.4 < z < 0.5, 
0.5 < z < 0.65, and 0.65 < z < 0.9, assuming the sky cov- 
erage and flux limits of the EMSS survey). For each slice, 
we compute the mean volume surveyed for each cluster 
temperature, assuming the fiducial LT relationships given 
above. We then generate a Poisson sample of clusters sat- 
isfying the n(T) law for the model in question, computed 
at the median redshift of the sample. 

We now assume that the true value of Q m has been 
measured by some independent method (to plus or minus 
0.05, except for the tt m = 1 case, where we assume an ex- 
act determination) and ask how well the mock catalogues 
are able to constrain the non-Gaussianity parameter G. 
Results for the likelihood function (marginalized over eg, 
which is unknown, and the uncertainty in Q m ) are shown 
in Figure 18. In the fi m = 1 case the improved data con- 
siderably reduces the errors on the determination of G, 
and in the Q m = 0.6 case an unambiguous detection of 



Fig. 18. — Mock likelihood functions ( mar ginalized over as) 
for the three models discussed in section 4.4 and the combina- 
tion of the Markevitch and EMSS mock datasets. The models 
are labeled by the value of Q m , and analyzed assuming that 
the true value of fl m has been measured (plus or minus errors 
of 0.05 in the Q m < 1 cases). 

From the above discussion, we see that even the con- 
siderable improvement in the quantity of cluster data dis- 
cussed above would not significantly improve constraints 
on non-Gaussianity if fi m is really of order 0.3. The reason 
cluster evolution does not give us such strong constraints 
in this case is that in a low density universe, clusters are 
not such rare events. For this reason, they do not probe 
the high-cr tail of the PDF, which is where Gaussian and 
non-Gaussian PDFs tend to differ most significantly. This 
fact is illustrated in Figure 19, where we show the cu- 
mulative number density of clusters expected at redshifts 
z = 0.05, z — 0.8, z — 1.5 and z = 2.0 for two models of 
structure formation (Gaussian - G — 1.0, as — 0.85, and 
non-Gaussian - G = 3.0, as — 0.75), both with Q m = 0.3, 
which are nearly degenerate with respect to the real and 
mock data discussed above. We see that the degree of 
evolution between z — 0.05 and z = 0.8 begins to differ 
considerably between the two models only for cluster tem- 
peratures larger than about 20 keV. However, clusters this 
hot are so rare (N > t < 10~ 10 /i 3 Mpc -3 ) that the surveys 
considered above do not probe enough volume to place any 
constraints on the number density. As we move to higher 
redshifts, lower temperature clusters become rarer, many- 
a events, and the degree of evolution to z — 1.5 differs 
considerably between the two models for cluster temper- 
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atures of order 15 keV. For the non-Gaussian model dis- 
cussed above, the number density of such clusters at red- 
shift z = 1.5 is iV>i5keV — 10~ 9 /i 3 Mpc -3 , meaning that a 
survey sensitive to clusters in the range 1.25 < z < 1.75 
would have to cover approximately 1000 square degrees to 
find an average of one T > 15 keV cluster. The existence 
of such a cluster would however be ten times less likely 
in a Gaussian universe, where A r > i5k c v — 10~ 10 /i 3 Mpc -3 . 
We conclude, therefore, that if the matter density of the 
universe does indeed turn out to be of order z = 0.3, much 
deeper cluster data will be required to improve constraints 
on non-Gaussianity, with surveys covering 1000 or more 
square degrees capable of detecting hot clusters (z > 15 
keV) at redshifts z > 1.5. A catalogue based on serendip- 
itous cluster detections from the forthcoming XMM satel- 
lite could hope to cover such an area to sufficient depth 
(Romer 1998). 




log 10 (kT/keV) 



Fig. 19. — Predicted cluster abundances at z — 0.05, z — 0.8, 
2 = 1.5, 2 = 2 (highest to lowest curves, respectively), for a 
Gaussian and a non-Gaussian model, as labeled, with f2 m =0.3. 



We make one final point on the degree to which future 
data will be able to constrain non-Gaussianity. From the 
above discussion, it is clear that the ability to detect non- 
Gaussianity depends on the detection of clusters arising 
from rare peaks in the primordial distribution. In addition 
to carrying out deeper observations, it will obviously be 
important to extend theoretical tests of the non-Gaussian 
Press-Schechter formalism to larger volumes, and to inves- 
tigate in detail any deviations from the PS formulae which 
exist in the rare peak limit. 



5. CONCLUSIONS 



We have made use of a modified version of the Press- 
Schechter formalism to examine the constraints on non- 
Gaussianity which can be derived from observations of 
evolution of the galaxy cluster number abundance. Our re- 
sults are summarized in Figure 13, which shows likelihood 
functions for the combined cluster datasets for the cases 
G = 1 (Gaussian fluctuations), Sl m = 1, and Vt m = 0.3. 
For an open universe with Gaussian fluctuations, the max- 
imum likelihood value of the matter density on the basis 
of current observations is fi m = 0.4 ± 0.25, with the case 
Q m = 1 inconsistent with the data at the 2a level. If we 
assume an VL m = 1 universe with non-Gaussian fluctua- 
tions, the best fit value of the non-Gaussianity parameter 
is in the range G = 6.5 ± 2, with G > 2.0 required for con- 
sistency with the data at the 2a level. The degree of non- 
Gaussianity required to 'save' the critical density universe 
is therefore relatively small, and could easily be realized by 
physically motivated models such as the combination adi- 
abatic and defect seeded perturbations predicted by some 
Hybrid inflation models. If we assume an f2 m = 0.3 uni- 
verse, the non-Gaussianity parameter is constrained to be 
G < 4 (67 < 6 in the lambda case) at the 2a level, implying 
that some well motivated non-Gaussian models, such as 
textures (G ~ 14) and Peebles ICDM (G ~ 15) would be 
inconsistent with the data. These conclusions are robust 
to a wide range of sources of systematic uncertainty. An 
fl m = 1 universe with Gaussian fluctuations could only be 
reconciled with the data if a conspiracy of several system- 
atic errors were all to modify our conclusions in the right 
direction. In particular, our results are unaffected by pos- 
sible systematic errors in the Press-Schechter prediction 
for the cluster abundance in non-Gaussian models, which 
RB00 has shown to be less than 25%. Our conclusions are 
also largely independent of whether the universe is open 
or flat (slightly lower values of O m or higher values of G 
are preferred in the cosmological constant case) . Since the 
non-Gaussian Press-Schechter formalism requires only the 
probability distribution function of primordial fluctuations 
as input, our analysis is independent of any uncertain fea- 
tures of the non-Gaussian physics in specific models. One 
source of uncertainty which does deserve further attention 
is the possibility that the PS formalism could break down 
in the very rare halo limit, as suggested by recent very 
large simulations (for example Governato et al. 1999). Our 
preliminary analysis in section suggests that this effect 
should not be too important for the range of cluster masses 
and redshifts discussed here, but this conclusion should be 
tested using larger simulations in the non-Gaussian case. 

The techniques discussed here allow us to constrain pri- 
mordial non-Gaussianity in the universe, provided we have 
an independent measurement of the matter density il m . 
We can realistically expect to gain such a measurement 
with high precision from the combination of upcoming 
CMB and supernovae data. If the matter density is mea- 
sured to be Q m = 1 then the current cluster evolution 
data represents a detection of non-Gaussianity. If the mat- 
ter density is measured to be f2 m = 0.6 or lower then 
both Gaussian and non-Gaussian fluctuations are consis- 
tent with current cluster evolution data. 

Our results for the Gaussian case agree well with those 
of Bahcall & Fan (1998a) who find n m = 0.2±gf and Eke 
et al. (1998) who find f2 TO = 0.45 ± 0.2, each using similar 
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data to that considered here. Our results disagree how- 
ever with those of Blanchard & Bartlett (1998), who find 
evidence in favor of an f2 m = 1 universe by combining the 
EMSS and ROSAT samples. As for non-Gaussian mod- 
els, our results are consistent with those of Willick (1998) 
in that both studies allow for a significant degree of non- 
Gaussianity if fi r „ ~ 0.3. Our analysis however does not 
significantly disfavor Gaussianity in this case. This dif- 
ference is probably due to the fact that we include more 
clusters and allow for more sources of systematic error, 
particularly in the low-redshift normalization. Our results 
are consistent with those of RGS98 and Koyama et al. 
(1999) provided that the matter density of the universe is 
low. In particular, RGS98 found that a significant amount 
of non-Gaussianity (G ~ 4.0l 2 !o) was required to recon- 
cile cluster abundance and cluster correlation data in the 
£2 m = 0.3 case. We caution the reader however that the ar- 
guments used in these last two papers depend on a model 
for halo correlations in non-Gaussian models whose agree- 
ment with N-body simulations is currently uncertain. On 
the other hand, the arguments used in the current work, 
and that of Willick (1998), are based on a model for halo 
abundance in non-Gaussian models which has been shown 
to agree well with that observed in simulations. 

We have also investigated the improvement to con- 
straints on non-Gaussianity which we could expect from 
a realistic increase in the quantity of data. If the mat- 
ter density of the universe is measured to be high (Q m 



greater than of order 0.6), then a moderate increase in the 
amount of cluster evolution data will allow a definite de- 
tection of non-Gaussianity. However, if the matter density 
is measured to be low, then a substantial increase in the 
quantity of cluster evolution will be required in order to 
significantly improve upon the limits derived here. The 
reason that constraints are weaker in the Q m — 0.3 case is 
that clusters at moderate redshifts (z ~ 0.9) are no longer 
particularly rare events, while the strongest constraints on 
Gaussianity come from probing the rarest tail of fluctua- 
tions. The best prospect for constraining non-Gaussianity 
in this case is to carry out deep surveys capable of detect- 
ing hot clusters (T > 15 keV) at high redshift (z > 1.5). 
Such surveys would be able to distinguish a non-Gaussian 
universe with G = 3 from a universe with Gaussian fluctu- 
ations if the area covered was of order 1000 square degrees. 
Conducting such surveys is a challenge which will deserve 
considerable attention if the matter density is indeed con- 
firmed to be low. 
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